Promise and Limitations of Supervised Optimal Transport-Based Graph Summarization via Information Theoretic Measures

نویسندگان

چکیده

Graph summarization is the problem of producing smaller graph representations an input dataset, in such a way that compressed graphs capture relevant structural information for downstream tasks. One recent methods formulates optimal transport-based framework allows prior about node, edge, and attribute importance to be incorporated into process. However, very little known statistical properties this framework. To elucidate question, we consider supervised summarization, wherein by using theoretic measures seek preserve class label. gain theoretical perspective on itself, first formulate it terms maximizing Shannon mutual between summarized We show NP-hardness approximation result problem, thereby constraining what one should expect from proposed solutions. then propose method incorporates estimates random variables associated with sample labels transport compression empirically performance improvements over previous works classification accuracy time synthetic certain real datasets. also theoretically explore limitations approach fails satisfy desirable monotonicity property.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information Measures via Copula Functions

In applications of differential geometry to problems of parametric inference, the notion of divergence is often used to measure the separation between two parametric densities. Among them, in this paper, we will verify measures such as Kullback-Leibler information, J-divergence, Hellinger distance, -Divergence, … and so on. Properties and results related to distance between probability d...

متن کامل

Graph Hybrid Summarization

One solution to process and analysis of massive graphs is summarization. Generating a high quality summary is the main challenge of graph summarization. In the aims of generating a summary with a better quality for a given attributed graph, both structural and attribute similarities must be considered. There are two measures named density and entropy to evaluate the quality of structural and at...

متن کامل

Multi-Document Summarization Using Graph-Based Iterative Ranking Algorithms and Information Theoretical Distortion Measures

Text summarization is an important field in the area of natural language processing and text mining. This paper proposes an extraction-based model which uses graphbased and information theoretic concepts for multidocument summarization. Our method constructs a directed weighted graph from the original text by adding a vertex for each sentence, and compute a weighted edge between sentences which...

متن کامل

Information-theoretic Semi-supervised Metric Learning via Entropy Regularization

We propose a general information-theoretic approach to semi-supervised metric learning called SERAPH (SEmi-supervised metRic leArning Paradigm with Hypersparsity) that does not rely on the manifold assumption. Given the probability parameterized by a Mahalanobis distance, we maximize its entropy on labeled data and minimize its entropy on unlabeled data following entropy regularization. For met...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2023

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2023.3302830